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Rare event statistics for random walks on complex networks are investigated using the large deviations for¬ 
malism. Within this formalism, rare events are realized as typical events in a suitably deformed path-ensemble, 
and their statistics can be studied in terms of spectral properties of a deformed Markov transition matrix. We 
observe two different types of phase transition in such systems: (i) rare events which are singled out for suffi¬ 
ciently large values of the deformation parameter may correspond to localized modes of the deformed transition 
matrix; (ii) “mode-switching transitions” may occur as the deformation parameter is varied. Details depend on 
the nature of the observable for which the rare event statistics is studied, as well as on the underlying graph 
ensemble. In the present letter we report on the statistics of the average degree of the nodes visited along a 
random walk trajectory in Erdos-Renyi networks. Large deviations rate functions and localization properties are 
studied numerically. For observables of the type considered here, we also derive an analytical approximation 
for the Legendre transform of the large-deviations rate function, which is valid in the large connectivity limit. It 
is found to agree well with simulations. 


Random walks are dynamical processes widely used to an¬ 
alyze, organize or perform important tasks on networks such 
as searches 011 , routing or data transport M. Their pop¬ 
ularity is due to their cheap implementation, as they rely only 
on local information, such as the state of the neighborhood 
of a given node of the network. This ensures network scala¬ 
bility and allows fast data transmission without the need for 
large storage facilities at nodes, such as big routing tables in 
communication networks. These features make random walks 
an efficient tool to explore networks characterized by a high 
cost of information. Examples are sensor networks |6|] where 
many signaling packets are needed to acquire wider networks 
status information. In peer-to-peer networks the absence of a 
central server storing file locations requires users to perform 
repeated local searches in order to find a file to download, and 
various random walk strategies have been proposed as a scal¬ 
able method M in this context. Less attention has been paid 
to characterize rare events associated with random walks on 
networks. Yet the occurrence of a rare event can have severe 
consequences. In hide-and-seek games for instance j loh . rare 
events represent situations where the seeker finds either most 
(or unusually many) of the hidden targets, or conversely none 
(or unusually few). In the context of cyber-security, where 
one is concerned with worms and viruses performing ran¬ 
dom walks through a network, a rare event would correspond 
to a situation where unusually many sensible nodes are suc¬ 
cessfully attacked and infected, which may have catastrophic 
consequences for the integrity of an entire IT infrastructure. 
Characterizing the statistics of rare events for random walks 
in complex networks and its dependence on network topology 
is thus a problem of considerable technological importance. A 
variant of this problem was recently analyzed for biased ran¬ 
dom walks in complex networks 111 ]. That paper addressed 
rare fluctuations in single node occupancy for an ensemble 
of independent (biased) walkers in the stationary state of the 


system. By contrast, our interest here is in rare event statis¬ 
tics of path averages, or equivalently of time integrated vari¬ 
ables. Rare event statistics of this type has been looked at for 
instance in the context of kinetically constrained models of 
glassy relaxation [ 12}]; relations to constrained ensembles of 
trajectories were explored in Cl for Glauber dynamics in the 
Id Ising chain. While these studies were primarily concerned 
with the use of large deviations theory as a tool to explore dy¬ 
namical phase transitions in homogeneous systems, our focus 
here is on the interplay between rare event statistics and the 
heterogeneity of the underlying system. 


In the present Letter we use large deviations theory to study 
rare events statistics for path averages of observables asso¬ 
ciated with sites visited along trajectories of random walks. 
Within this formalism, rare events are realized as typical 
events in a suitably deformed path-ensemble ii. Then- 
statistics can be studied in terms of spectral properties of a de¬ 
formed version of the Markov transition matrix for the orig¬ 
inal random walk model, the relevant information being ex¬ 
tracted from the algebraically largest eigenvalue of the de¬ 
formed transition matrix. Such deformation may direct ran¬ 
dom walks to subsets of a network with vertices of either 
atypically high or atypically low coordination. It also ampli¬ 
fies the heterogeneity of transition matrix elements for large 
values of the deformation parameter and we observe that, as 
a consequence, the eigenvector corresponding to the largest 
eigenvalue of the deformed transition matrix may exhibit a lo¬ 
calization transition, indicating that rare large fluctuations of 
path averages are typically realized by trajectories that remain 
localized on small subsets of the network. Within localized 
phases, we also encounter a second type of dynamical phase 
transition related to switching between modes as the defor¬ 
mation parameter used to select rare events is varied. Our 
methods allow us to study the role that network topology and 
heterogeneity play in selecting these special paths, as well as 
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to infer properties of paths actually selected to realize extreme 
events. 

The model. We consider a complex network with adja¬ 
cency matrix A, with entries a.ij — 1 if the edge (ij) exists, 
aij = 0 otherwise. The transition matrix W of an unbiased 
random walk has entries W,, = a,y/k/ where kj is the degree 
of node j and IV ( / is the probability of a transition from j to i. 

Writing if = O'o, i\, • ■ • , if) a path of length l, quantities of 
interest are empirical path-averages of the form 


^ = a) 

r=i 

where the are quenched random variables associated with 
the vertices i = 1,... ,N of the graph, which could be inde¬ 
pendent of, be correlated with, or be deterministic functions 
of the degrees k; of the vertices. It is expected that the ff are 
for large t sharply peaked about their mean 
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where P(i/) denotes the probability of the path i/. 

The average Q can be obtained from the cumulant generat¬ 
ing function f e (s) = In J] if P {as f ( = ^'(s)U 0 - 
Here, we are interested in rare events, for which the empiri¬ 
cal averages ff take values f which differ significantly from 
their mean ff. Large deviations theory predicts that for f. » 1 
the probability density P(f) for such an event scales exponen¬ 
tially with path-length i, P(f) ~ e~ f/(l ^, with a rate function 
1(f) which, according to the Gartner-Ellis theorem 114] is ob¬ 
tained as a Legendre transform 1(f) = sup s (s0 - f(s)} of the 
limiting cumulant generating function f(s) = limf_»oo ft(s), 
provided that this limit exists and that it is differentiable. We 
shall see that the second condition may be violated, and that 
the derivative f(s) may develop discontinuities at certain s- 
values, entailing that we observe regions where 1(f) is strictly 
linear and only represents the convex hull of the true rate func¬ 
tion |l_4j. 

In order to evaluate fe(s), we express path probabilities us¬ 
ing the Markov transition matrix W and a distribution po = 
(PoO'o)) of initial conditions as P(i { ) = | Ilf=i W llhl ]p(i 0 ), 
entailing that fc(s) can be evaluated in terms of a de¬ 
formed transition matrix W(s) = (e 5 ^ W,j) as fc(s) = 

t~ x In Z(-,,i 0 [W /f (‘ v )]ifi 0 p(io)- Using a spectral decomposition of 
the deformed transition matrix one can write this as 


f{(s) = lnTi + - In 


(l,vi)(wi,p 0 ) + 2 1^-) (l,v«)(w tt ,p 0 ) 




(3) 

Here the A a are eigenvalues of W(s), the and w„. are the 
corresponding right and left eigenvectors, 1 = ( 1 ,..., 1 ), and 
the bracket notation (•, •) is used to denote an inner prod¬ 
uct. Eigenvalues are taken to be sorted in decreasing order 
A\ > |/L| > IT 3 I • ■ ■ > An, with the first inequality being a 


consequence of the Perron-Frobenius theorem |15]. This con¬ 
cludes the general framework. For the remainder of this Let¬ 
ter, we will restrict our attention to the case where f = f(k,). 

For long paths, the value of the cumulant generating func¬ 
tion is dominated by the leading eigenvalue 4| = /L(s) of the 
transition matrix W(s), so f(s) = log/li(s). In the s - 0 case, 
the eigenvalue problem is trivial, as the column-stochasticity 
of the transition matrix yields a left eigenvector w, = 1 corre¬ 
sponding to the maximal eigenvalue Ai = 1. The associated 
right eigenvector is v,- oc k,. For nonzero s, such closed form 
expressions are in general not known. Performing a direct 
matrix diagonalization is quite daunting for large system sizes 
N, even if one exploits methods that calculates only the first 
eigenvalue [ 16j]- Hence we are interested in fast viable approx¬ 
imations. Here we describe one such approximation expected 
to be valid for networks in which vertex degrees are typically 
large. 

Degree-based approximation. We start by considering the 
left eigenvectors w instead of the right eigenvectors, for which 
the eigenvalue equation can be written as 


Awj= yX 


Wit 


s f(kj) 


(4) 
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This system of equations can be simplified by considering 
a degree-based approximation for the first eigenvector, where 
one assumes that the values of w,- only depend on the degree 
of the node i: w, = w(k t ). If the average degree is large enough 
and the degree distribution is not too heterogeneous, we can 
write the eigenvalue equation © by appeal to the law of large 
numbers as 

Ti(s) w(k) = ^ P(k'\k) w(k') e sf(k,) (5) 

k’ 


where P(k'\k) is the probability for the neighbor of a node of 
degree k to have degree k'. 


In an Erdos-Renyi (ER) ensemble (1711 . and more gener¬ 
ally in any configuration model ensemble, we have P(k'\k) = 
P(k')jpj. In this case the right-hand side of (© does not depend 
on k and the w(k) are in fact k-independent. The eigenvalue 
equation then simplifies to 
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where the average is over the degree distribution P(k). This 
approximation yields excellent results for large mean connec¬ 
tivities c = (k) on ER graphs, and more generally for configu¬ 
ration models without low degree nodes. This is illustrated in 
figureQ] where we plot a comparison with numerical simula¬ 
tions for ER graphs with c = 30. In figure Q] and throughout 
the remainder of the paper simulation results are obtained as 
averages over 1000 samples. 

Eigenvector localization. Because of the heterogeneity of 
the underlying system, one finds the random walk transition 
matrix to exhibit localized states, both for fast and slow relax¬ 
ation modes 01 , even in the undeformed system, although 
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FIG. 1. (Colour online) Cumulant generating function if/(s ) for ER 
networks with c = 30 and /(£,) = kjc, comparing the large-degree 
approximation © (blue line) with results of a numerical simulation 
(green line). The inset shows the corresponding rate functions. 


the eigenvector corresponding to the largest eigenvalue (the 
equilibrium distribution) is typically delocalized. However, 
given the nature of the deformed transition matrix, one expects 
the deformed random walk for large |.v| to be localized around 
vertices where sf{ki) is very large; hence we anticipate that in 
the deformed system, even the eigenvector corresponding to 
the largest eigenvalue may become localized for sufficiently 
large |s|. In order to investigate this effect quantitatively we 
look at the inverse participation ratio of the eigenvector corre¬ 
sponding to the largest eigenvalue A\ of W(s). Denoting by v, 
its /-th component, we have 

V 1 A 

IPR[v] = -=^-2 (7) 

[Z/'f] 

One expects IPR[v] ~ N for a delocalized vector, whereas 
IPR[v] = (5(1) if v is localized. 

Results on random graphs. We performed numerical sim¬ 
ulations to evaluate T (,y) and the IPR[vi(s)] for several types 
of network, defined by their random graph topology. In the 
present letter we restrict ourselves to discussing results for 
ER networks. We found that other network ensembles such 
as scale-free random graphs give qualitatively similar results; 
we will report on these in an extended version of this letter. 

We looked at various examples for the function /(£,) but 
in the present letter we only report results for the normal¬ 
ized degree f(ki) = k,/c; other deterministic types of degree- 
dependent functions exhibit similar behavior, thus focusing 
on the normalized degree is sufficient to capture the impor¬ 
tant aspects of this problem. We restrict our simulations to the 
largest (giant) component of the graphs, in order to prevent 
spurious effects of isolated nodes or small disconnected clus¬ 
ters (e.g. dimers) dominating Ti (s) and the IPR for negative s, 
as these would represent trivial instances of rare events, where 
a walker starts, and is thus stuck on a small disconnected com¬ 
ponent of the graph. From here on, the network size given 


must be understood as the size of the networks from which 
the giant component is extracted. 

Fig. [2]shows the existence of two localized regimes for suf¬ 
ficiently large values of |s|, with IPRs on the localized side of 
both transitions increasing with system size. Results can be 
understood, as for large |s| the deformed random walk is nat¬ 
urally attracted to the nodes with the largest (resp. smallest) 
degrees for positive (resp. negative) s. Thus for large nega¬ 
tive s the deformed walk tends to be concentrated at the end 
of the longest dangling chain, whereas for large positive s it 
will be concentrated at the site with the largest available co¬ 
ordination. On an ER network where the large-degree tail of 
the degree distribution decays very fast, such a high degree 
vertex is likely to be connected to vertices whose degrees are 
lower, even significantly lower, than that of the highest degree 
vertex in the network, which leads to IPRs approaching 1 in 
the large N limit. Conversely, for negative ,v, the deformed 
random walk will be attracted to the ends of dangling chains 
in the network, with the probability of escape from a chain 
decreasing with its length (with the length of the longest dan¬ 
gling chain increasing with system size). This can explain 
that IPRs initially saturate at 1/2 for large systems. Only upon 
further decreasing s to more negative values will the asym¬ 
metry of the deformed transition matrices, to and away from 
the end of a dangling chain, induce that further weight of the 
dominant eigenvector to become concentrated on the end-site, 
leading to a further increase of the IPR. 
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FIG. 2. IPR[v] as a function of the deformation parameter .y for ER 
graphs with c = 6, and /(£,) = k,/c. The inset exhibits the N~ l - 
scaling of IPRs for 4 different values of the deformation parameter s, 
chosen in pairs on either side of two localization transitions, one at 
negative, and one at positive i. 

From the values of /l|(,v) we also derived the large devia¬ 
tion rate functions for path averages of the normalized degree 
f(kj) = kj/c, for various systems sizes and average connec¬ 
tivities. In fig. © we report /(</>) for an ER network at a low 
connectivity of c = 3. While the right branch of 7(0) is for 
large N well approximated by a parabola, our results show 
the emergence of a linear region on the left branch, which 
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becomes more pronounced as the system size is increased. 
This is a signature of a non-difFerentiable point of >J/(s) at 
a point 5 * estimated to be at s* = -0.060 ± 0.002: at this 
point the Gartner-Ellis theorem cannot be used to evaluate the 
rate function, and the linear branch only represents the con¬ 
vex envelope of the true /(</;) [14]. The latter can either coin¬ 
cide with its convex envelope, or it can indeed be non-convex. 
However this information cannot be accessed by the theorem. 
The emergence of a jump-discontinuity in f(s) is due to a 
level crossing of the two largest eigenvalues, where the system 
switches between two modes that correspond to the largest 
eigenvalue on either side of s*. In finite systems the crossing 
is an ‘avoided crossing’ due to level repulsion, but the two 
largest eigenvalues become asymptotically degenerate at s * in 
the IV —> oo limit, leading to a divergence of the correlation 
length f(s) = [ln(/fi(5)//l2(s)] _1 at s*, in close analogy with 
phenomenology of second order phase transitions, the diver¬ 
gence being logarithmic in N in the present case. 



<t> 


FIG. 3. Rate function 1(f) for ER graphs with c = 3, and f(k ,) = kfc 
for system sizes ranging from N = 100 to N = 6400. In the inset, 
we show i f/(s) in the vicinity of the non-difFerentiable point. For the 
largest system size, a linear fit of the convex envelope of the left 
branch and a quadratic fit of the right branch of 1(f) are shown as 
well. 

Conclusions and future perspectives. In this Letter we 
have analyzed rare events statistics for path averages of ob¬ 
servables associated with sites visited along random walk tra¬ 
jectories on complex networks. Results are obtained by look¬ 
ing at spectral properties of suitably deformed transition ma¬ 
trices. The main outcome of our analysis is the possible emer¬ 
gence of two types of dynamical phase transitions in low mean 
degree systems: localization transitions which entail that large 
deviations from typical values of path averages may be real¬ 
ized by localized modes of a deformed transition matrix, and 
mode-switching transitions signifying that the modes (eigen¬ 
vectors) in terms of which large deviations are typically real¬ 
ized may switch as the deformation parameter s and thus the 
actual scale of large deviations are varied. Results of numer¬ 
ical simulations consistently support these claims. We also 
developed an analytical approximation valid for networks in 


which degrees are typically large. 

Our work opens up the perspective to study a broad range of 
further interesting problems. On a technical level, one would 
want to implement more powerful techniques, such as derived 
in 0. to obtain the largest eigenvalue in the present problem 
class for larger system sizes. Then there is clearly the need 
to systematically study the dependence of the phenomena re¬ 
ported here on the degree statistics, and on the nature of the 
observables for which path averages are looked at. We have 
gone some way in this direction and will report results in an 
extended version of the present paper. In particular one might 
wish to look at observables which, rather then being determin¬ 
istic functions of the degree, are only statistically correlated 
with the degree, or at observables taking values on edges be¬ 
tween nodes [13t 14fl. This could be of interest in applications 
such as traffic or information flows on networks subject to ca¬ 
pacity constraints on edges. Moreover, given the nature of the 
mode-switching transition observed in the present letter, it is 
clearly conceivable that several such transitions could be ob¬ 
served in a single system, depending of course on the nature 
of the observables studied and on the topological properties of 
the underlying networks. Finally, critical phenomena associ¬ 
ated with the localization transition and with mode-switching 
transitions also deserve further study. We believe that this list 
could go on. 

This work was supported by the Marie Curie Training Net¬ 
work NETADIS (FP7, grant 290038). 
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